Overview
Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 6807 |
| Missing cells | 2471 |
| Missing cells (%) | 1.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 3.3 MiB |
| Average record size in memory | 514.5 B |
Variable types
| Numeric | 17 |
|---|---|
| Unsupported | 5 |
| Text | 1 |
| DateTime | 2 |
item is highly overall correlated with activity_days and 2 other fields | High correlation |
review_count is highly overall correlated with item_user_heavy_ratio and 1 other fields | High correlation |
year is highly overall correlated with activity_days and 2 other fields | High correlation |
activity_span is highly overall correlated with item and 3 other fields | High correlation |
activity_days is highly overall correlated with item and 2 other fields | High correlation |
start_year is highly overall correlated with activity_days and 2 other fields | High correlation |
end_year is highly overall correlated with end_month | High correlation |
end_month is highly overall correlated with end_year | High correlation |
item_user_median_activity is highly overall correlated with item_user_heavy_ratio and 1 other fields | High correlation |
item_user_heavy_ratio is highly overall correlated with item_user_median_activity and 1 other fields | High correlation |
directors has 1304 (19.2%) missing values | Missing |
writers has 1159 (17.0%) missing values | Missing |
item has unique values | Unique |
genres is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
directors is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
writers is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
activity_span is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
genre_combo is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
start_weekday has 1670 (24.5%) zeros | Zeros |
start_hour has 277 (4.1%) zeros | Zeros |
end_weekday has 1553 (22.8%) zeros | Zeros |
end_hour has 308 (4.5%) zeros | Zeros |
n_writers has 1159 (17.0%) zeros | Zeros |
n_directors has 1304 (19.2%) zeros | Zeros |
Reproduction
| Analysis started | 2025-12-22 19:05:51.175519 |
|---|---|
| Analysis finished | 2025-12-22 19:06:23.506012 |
| Duration | 32.33 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
item
Real number (ℝ)
High correlation Unique
| Distinct | 6807 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26626.948 |
| Minimum | 1 |
|---|---|
| Maximum | 119145 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 556.6 |
| Q1 | 3054.5 |
| median | 6882 |
| Q3 | 49823 |
| 95-th percentile | 94054.4 |
| Maximum | 119145 |
| Range | 119144 |
| Interquartile range (IQR) | 46768.5 |
Descriptive statistics
| Standard deviation | 32194.062 |
|---|---|
| Coefficient of variation (CV) | 1.2090782 |
| Kurtosis | -0.11608162 |
| Mean | 26626.948 |
| Median Absolute Deviation (MAD) | 5608 |
| Skewness | 1.1011684 |
| Sum | 1.8124964 × 108 |
| Variance | 1.0364577 × 109 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 119145 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Other values (6797) | 6797 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 119145 | 1 | |
| 119141 | 1 | |
| 118997 | 1 | |
| 118900 | 1 | |
| 118700 | 1 | |
| 118696 | 1 | |
| 117881 | 1 | |
| 117533 | 1 | |
| 117176 | 1 | |
| 116823 | 1 |
review_count
Real number (ℝ)
High correlation
| Distinct | 1849 |
|---|---|
| Distinct (%) | 27.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 757.23094 |
| Minimum | 27 |
|---|---|
| Maximum | 19699 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.3 KiB |
Quantile statistics
| Minimum | 27 |
|---|---|
| 5-th percentile | 52 |
| Q1 | 90 |
| median | 197 |
| Q3 | 610.5 |
| 95-th percentile | 3312.1 |
| Maximum | 19699 |
| Range | 19672 |
| Interquartile range (IQR) | 520.5 |
Descriptive statistics
| Standard deviation | 1682.9731 |
|---|---|
| Coefficient of variation (CV) | 2.2225361 |
| Kurtosis | 32.335437 |
| Mean | 757.23094 |
| Median Absolute Deviation (MAD) | 133 |
| Skewness | 5.0438171 |
| Sum | 5154471 |
| Variance | 2832398.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 62 | 0.9% |
| 58 | 54 | 0.8% |
| 52 | 52 | 0.8% |
| 56 | 51 | 0.7% |
| 48 | 50 | 0.7% |
| 55 | 46 | 0.7% |
| 54 | 45 | 0.7% |
| 59 | 44 | 0.6% |
| 49 | 44 | 0.6% |
| 51 | 43 | 0.6% |
| Other values (1839) | 6316 |
| Value | Count | Frequency (%) |
| 27 | 1 | < 0.1% |
| 34 | 2 | < 0.1% |
| 36 | 1 | < 0.1% |
| 38 | 2 | < 0.1% |
| 39 | 3 | < 0.1% |
| 40 | 6 | |
| 41 | 8 | |
| 42 | 7 | |
| 43 | 8 | |
| 44 | 12 |
| Value | Count | Frequency (%) |
| 19699 | 1 | |
| 18437 | 1 | |
| 18202 | 1 | |
| 18168 | 1 | |
| 17339 | 1 | |
| 17237 | 1 | |
| 16656 | 1 | |
| 16387 | 1 | |
| 15847 | 1 | |
| 15213 | 1 |
genres
Unsupported
Rejected Unsupported
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 579.4 KiB |
directors
Unsupported
Missing Rejected Unsupported
| Missing | 1304 |
|---|---|
| Missing (%) | 19.2% |
| Memory size | 472.1 KiB |
writers
Unsupported
Missing Rejected Unsupported
| Missing | 1159 |
|---|---|
| Missing (%) | 17.0% |
| Memory size | 505.4 KiB |
year
Real number (ℝ)
High correlation
| Distinct | 93 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 8 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1992.1747 |
| Minimum | 1922 |
|---|---|
| Maximum | 2014 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.3 KiB |
Quantile statistics
| Minimum | 1922 |
|---|---|
| 5-th percentile | 1950 |
| Q1 | 1985 |
| median | 1999 |
| Q3 | 2006 |
| 95-th percentile | 2012 |
| Maximum | 2014 |
| Range | 92 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 19.052568 |
|---|---|
| Coefficient of variation (CV) | 0.0095637034 |
| Kurtosis | 1.2781437 |
| Mean | 1992.1747 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | -1.3709319 |
| Sum | 13544796 |
| Variance | 363.00036 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2006 | 284 | 4.2% |
| 2007 | 271 | 4.0% |
| 2004 | 262 | 3.8% |
| 2005 | 260 | 3.8% |
| 2008 | 256 | 3.8% |
| 2009 | 247 | 3.6% |
| 2002 | 242 | 3.6% |
| 2003 | 236 | 3.5% |
| 2001 | 222 | 3.3% |
| 2010 | 213 | 3.1% |
| Other values (83) | 4306 |
| Value | Count | Frequency (%) |
| 1922 | 3 | < 0.1% |
| 1923 | 2 | < 0.1% |
| 1924 | 5 | |
| 1925 | 6 | |
| 1926 | 2 | < 0.1% |
| 1927 | 5 | |
| 1928 | 6 | |
| 1929 | 5 | |
| 1930 | 4 | 0.1% |
| 1931 | 10 |
| Value | Count | Frequency (%) |
| 2014 | 76 | 1.1% |
| 2013 | 134 | |
| 2012 | 140 | |
| 2011 | 185 | |
| 2010 | 213 | |
| 2009 | 247 | |
| 2008 | 256 | |
| 2007 | 271 | |
| 2006 | 284 | |
| 2005 | 260 |
title
Text
| Distinct | 6806 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 569.4 KiB |
Length
| Max length | 134 |
|---|---|
| Median length | 98 |
| Mean length | 26.000588 |
| Min length | 8 |
Unique
| Unique | 6805 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | Toy Story (1995) |
|---|---|
| 2nd row | Jumanji (1995) |
| 3rd row | Grumpier Old Men (1995) |
| 4th row | Waiting to Exhale (1995) |
| 5th row | Father of the Bride Part II (1995) |
| Value | Count | Frequency (%) |
| the | 2387 | 8.2% |
| of | 741 | 2.5% |
| a | 302 | 1.0% |
| 2006 | 284 | 1.0% |
| 2007 | 271 | 0.9% |
| and | 263 | 0.9% |
| 2004 | 262 | 0.9% |
| 2005 | 260 | 0.9% |
| 2008 | 256 | 0.9% |
| 2009 | 247 | 0.8% |
| Other values (7665) | 24006 |
Most occurring characters
| Value | Count | Frequency (%) |
| 22475 | 12.7% | |
| e | 12541 | 7.1% |
| a | 8163 | 4.6% |
| ( | 7763 | 4.4% |
| ) | 7763 | 4.4% |
| o | 7430 | 4.2% |
| n | 6984 | 3.9% |
| i | 6683 | 3.8% |
| r | 6587 | 3.7% |
| 0 | 6492 | 3.7% |
| Other values (114) | 84105 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 176986 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 22475 | 12.7% | |
| e | 12541 | 7.1% |
| a | 8163 | 4.6% |
| ( | 7763 | 4.4% |
| ) | 7763 | 4.4% |
| o | 7430 | 4.2% |
| n | 6984 | 3.9% |
| i | 6683 | 3.8% |
| r | 6587 | 3.7% |
| 0 | 6492 | 3.7% |
| Other values (114) | 84105 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 176986 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 22475 | 12.7% | |
| e | 12541 | 7.1% |
| a | 8163 | 4.6% |
| ( | 7763 | 4.4% |
| ) | 7763 | 4.4% |
| o | 7430 | 4.2% |
| n | 6984 | 3.9% |
| i | 6683 | 3.8% |
| r | 6587 | 3.7% |
| 0 | 6492 | 3.7% |
| Other values (114) | 84105 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 176986 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 22475 | 12.7% | |
| e | 12541 | 7.1% |
| a | 8163 | 4.6% |
| ( | 7763 | 4.4% |
| ) | 7763 | 4.4% |
| o | 7430 | 4.2% |
| n | 6984 | 3.9% |
| i | 6683 | 3.8% |
| r | 6587 | 3.7% |
| 0 | 6492 | 3.7% |
| Other values (114) | 84105 |
start_datetime
Date
| Distinct | 6796 |
|---|---|
| Distinct (%) | 99.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.3 KiB |
| Minimum | 2005-04-11 11:56:25 |
|---|---|
| Maximum | 2015-01-03 05:32:06 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
end_datetime
Date
| Distinct | 6787 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 53.3 KiB |
| Minimum | 2010-12-09 13:41:09 |
|---|---|
| Maximum | 2015-03-31 05:50:52 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
activity_span
Unsupported
High correlation Rejected Unsupported
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 53.3 KiB |
activity_days
Real number (ℝ)
High correlation
| Distinct | 1941 |
|---|---|
| Distinct (%) | 28.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3043.4441 |
| Minimum | 85 |
|---|---|
| Maximum | 3640 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.3 KiB |
Quantile statistics
| Minimum | 85 |
|---|---|
| 5-th percentile | 1031.3 |
| Q1 | 2584.5 |
| median | 3575 |
| Q3 | 3631 |
| 95-th percentile | 3639 |
| Maximum | 3640 |
| Range | 3555 |
| Interquartile range (IQR) | 1046.5 |
Descriptive statistics
| Standard deviation | 886.97684 |
|---|---|
| Coefficient of variation (CV) | 0.29143852 |
| Kurtosis | 1.1088977 |
| Mean | 3043.4441 |
| Median Absolute Deviation (MAD) | 63 |
| Skewness | -1.4812014 |
| Sum | 20716724 |
| Variance | 786727.92 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3639 | 267 | 3.9% |
| 3637 | 230 | 3.4% |
| 3638 | 227 | 3.3% |
| 3640 | 192 | 2.8% |
| 3636 | 175 | 2.6% |
| 3632 | 149 | 2.2% |
| 3635 | 134 | 2.0% |
| 3633 | 130 | 1.9% |
| 3634 | 125 | 1.8% |
| 3631 | 110 | 1.6% |
| Other values (1931) | 5068 |
| Value | Count | Frequency (%) |
| 85 | 1 | |
| 91 | 1 | |
| 93 | 1 | |
| 95 | 1 | |
| 100 | 1 | |
| 102 | 1 | |
| 109 | 1 | |
| 112 | 2 | |
| 120 | 1 | |
| 129 | 1 |
| Value | Count | Frequency (%) |
| 3640 | 192 | |
| 3639 | 267 | |
| 3638 | 227 | |
| 3637 | 230 | |
| 3636 | 175 | |
| 3635 | 134 | |
| 3634 | 125 | |
| 3633 | 130 | |
| 3632 | 149 | |
| 3631 | 110 |
start_year
Real number (ℝ)
High correlation
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2006.4147 |
| Minimum | 2005 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.7 KiB |
Quantile statistics
| Minimum | 2005 |
|---|---|
| 5-th percentile | 2005 |
| Q1 | 2005 |
| median | 2005 |
| Q3 | 2008 |
| 95-th percentile | 2012 |
| Maximum | 2015 |
| Range | 10 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.3578059 |
|---|---|
| Coefficient of variation (CV) | 0.0011751339 |
| Kurtosis | 1.4053833 |
| Mean | 2006.4147 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.5795055 |
| Sum | 13657665 |
| Variance | 5.5592485 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2005 | 4481 | |
| 2008 | 429 | 6.3% |
| 2009 | 390 | 5.7% |
| 2006 | 302 | 4.4% |
| 2007 | 299 | 4.4% |
| 2010 | 283 | 4.2% |
| 2011 | 227 | 3.3% |
| 2012 | 161 | 2.4% |
| 2013 | 137 | 2.0% |
| 2014 | 97 | 1.4% |
| Value | Count | Frequency (%) |
| 2005 | 4481 | |
| 2006 | 302 | 4.4% |
| 2007 | 299 | 4.4% |
| 2008 | 429 | 6.3% |
| 2009 | 390 | 5.7% |
| 2010 | 283 | 4.2% |
| 2011 | 227 | 3.3% |
| 2012 | 161 | 2.4% |
| 2013 | 137 | 2.0% |
| 2014 | 97 | 1.4% |
| Value | Count | Frequency (%) |
| 2015 | 1 | < 0.1% |
| 2014 | 97 | 1.4% |
| 2013 | 137 | 2.0% |
| 2012 | 161 | 2.4% |
| 2011 | 227 | |
| 2010 | 283 | |
| 2009 | 390 | |
| 2008 | 429 | |
| 2007 | 299 | |
| 2006 | 302 |
start_month
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.0832966 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 4 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 11 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 2.5059955 |
|---|---|
| Coefficient of variation (CV) | 0.49298628 |
| Kurtosis | 1.1514156 |
| Mean | 5.0832966 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.3554296 |
| Sum | 34602 |
| Variance | 6.2800133 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 4006 | |
| 5 | 534 | 7.8% |
| 6 | 276 | 4.1% |
| 11 | 255 | 3.7% |
| 10 | 254 | 3.7% |
| 1 | 241 | 3.5% |
| 7 | 218 | 3.2% |
| 12 | 215 | 3.2% |
| 9 | 211 | 3.1% |
| 8 | 210 | 3.1% |
| Other values (2) | 387 | 5.7% |
| Value | Count | Frequency (%) |
| 1 | 241 | 3.5% |
| 2 | 180 | 2.6% |
| 3 | 207 | 3.0% |
| 4 | 4006 | |
| 5 | 534 | 7.8% |
| 6 | 276 | 4.1% |
| 7 | 218 | 3.2% |
| 8 | 210 | 3.1% |
| 9 | 211 | 3.1% |
| 10 | 254 | 3.7% |
| Value | Count | Frequency (%) |
| 12 | 215 | 3.2% |
| 11 | 255 | 3.7% |
| 10 | 254 | 3.7% |
| 9 | 211 | 3.1% |
| 8 | 210 | 3.1% |
| 7 | 218 | 3.2% |
| 6 | 276 | 4.1% |
| 5 | 534 | 7.8% |
| 4 | 4006 | |
| 3 | 207 | 3.0% |
start_weekday
Real number (ℝ)
Zeros
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.3246658 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 1670 |
| Zeros (%) | 24.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.9386903 |
|---|---|
| Coefficient of variation (CV) | 0.83396519 |
| Kurtosis | -1.0516509 |
| Mean | 2.3246658 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.38000105 |
| Sum | 15824 |
| Variance | 3.7585202 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1670 | |
| 1 | 1126 | |
| 2 | 1046 | |
| 3 | 923 | |
| 4 | 901 | |
| 5 | 613 | 9.0% |
| 6 | 528 | 7.8% |
| Value | Count | Frequency (%) |
| 0 | 1670 | |
| 1 | 1126 | |
| 2 | 1046 | |
| 3 | 923 | |
| 4 | 901 | |
| 5 | 613 | 9.0% |
| 6 | 528 | 7.8% |
| Value | Count | Frequency (%) |
| 6 | 528 | 7.8% |
| 5 | 613 | 9.0% |
| 4 | 901 | |
| 3 | 923 | |
| 2 | 1046 | |
| 1 | 1126 | |
| 0 | 1670 |
start_hour
Real number (ℝ)
Zeros
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.372851 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 277 |
| Zeros (%) | 4.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5 |
| median | 14 |
| Q3 | 19 |
| 95-th percentile | 23 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 7.2274018 |
|---|---|
| Coefficient of variation (CV) | 0.58413389 |
| Kurtosis | -1.3091039 |
| Mean | 12.372851 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.22460993 |
| Sum | 84222 |
| Variance | 52.235337 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21 | 425 | 6.2% |
| 16 | 424 | 6.2% |
| 18 | 392 | 5.8% |
| 20 | 360 | 5.3% |
| 15 | 356 | 5.2% |
| 23 | 347 | 5.1% |
| 14 | 327 | 4.8% |
| 2 | 323 | 4.7% |
| 19 | 320 | 4.7% |
| 17 | 311 | 4.6% |
| Other values (14) | 3222 |
| Value | Count | Frequency (%) |
| 0 | 277 | |
| 1 | 257 | |
| 2 | 323 | |
| 3 | 285 | |
| 4 | 279 | |
| 5 | 288 | |
| 6 | 259 | |
| 7 | 208 | |
| 8 | 251 | |
| 9 | 174 |
| Value | Count | Frequency (%) |
| 23 | 347 | |
| 22 | 289 | |
| 21 | 425 | |
| 20 | 360 | |
| 19 | 320 | |
| 18 | 392 | |
| 17 | 311 | |
| 16 | 424 | |
| 15 | 356 | |
| 14 | 327 |
end_year
Real number (ℝ)
High correlation
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2014.8781 |
| Minimum | 2010 |
|---|---|
| Maximum | 2015 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.7 KiB |
Quantile statistics
| Minimum | 2010 |
|---|---|
| 5-th percentile | 2014 |
| Q1 | 2015 |
| median | 2015 |
| Q3 | 2015 |
| 95-th percentile | 2015 |
| Maximum | 2015 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.39516535 |
|---|---|
| Coefficient of variation (CV) | 0.0001961237 |
| Kurtosis | 19.852534 |
| Mean | 2014.8781 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -3.9367959 |
| Sum | 13715275 |
| Variance | 0.15615565 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2015 | 6114 | |
| 2014 | 580 | 8.5% |
| 2013 | 94 | 1.4% |
| 2012 | 15 | 0.2% |
| 2011 | 3 | < 0.1% |
| 2010 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2010 | 1 | < 0.1% |
| 2011 | 3 | < 0.1% |
| 2012 | 15 | 0.2% |
| 2013 | 94 | 1.4% |
| 2014 | 580 | 8.5% |
| 2015 | 6114 |
| Value | Count | Frequency (%) |
| 2015 | 6114 | |
| 2014 | 580 | 8.5% |
| 2013 | 94 | 1.4% |
| 2012 | 15 | 0.2% |
| 2011 | 3 | < 0.1% |
| 2010 | 1 | < 0.1% |
end_month
Real number (ℝ)
High correlation
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3261349 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 3 |
| Q3 | 3 |
| 95-th percentile | 11 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.2485827 |
|---|---|
| Coefficient of variation (CV) | 0.67603473 |
| Kurtosis | 7.4547036 |
| Mean | 3.3261349 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.8111663 |
| Sum | 22641 |
| Variance | 5.0561241 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 4721 | |
| 2 | 921 | 13.5% |
| 1 | 540 | 7.9% |
| 12 | 187 | 2.7% |
| 11 | 156 | 2.3% |
| 10 | 61 | 0.9% |
| 7 | 54 | 0.8% |
| 9 | 46 | 0.7% |
| 8 | 40 | 0.6% |
| 6 | 32 | 0.5% |
| Other values (2) | 49 | 0.7% |
| Value | Count | Frequency (%) |
| 1 | 540 | 7.9% |
| 2 | 921 | 13.5% |
| 3 | 4721 | |
| 4 | 23 | 0.3% |
| 5 | 26 | 0.4% |
| 6 | 32 | 0.5% |
| 7 | 54 | 0.8% |
| 8 | 40 | 0.6% |
| 9 | 46 | 0.7% |
| 10 | 61 | 0.9% |
| Value | Count | Frequency (%) |
| 12 | 187 | 2.7% |
| 11 | 156 | 2.3% |
| 10 | 61 | 0.9% |
| 9 | 46 | 0.7% |
| 8 | 40 | 0.6% |
| 7 | 54 | 0.8% |
| 6 | 32 | 0.5% |
| 5 | 26 | 0.4% |
| 4 | 23 | 0.3% |
| 3 | 4721 |
end_weekday
Real number (ℝ)
Zeros
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.1355957 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 1553 |
| Zeros (%) | 22.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 6 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.3388981 |
|---|---|
| Coefficient of variation (CV) | 0.74591827 |
| Kurtosis | -1.5599335 |
| Mean | 3.1355957 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.11242667 |
| Sum | 21344 |
| Variance | 5.4704445 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 1705 | |
| 0 | 1553 | |
| 5 | 908 | |
| 1 | 774 | |
| 4 | 753 | |
| 3 | 560 | 8.2% |
| 2 | 554 | 8.1% |
| Value | Count | Frequency (%) |
| 0 | 1553 | |
| 1 | 774 | |
| 2 | 554 | 8.1% |
| 3 | 560 | 8.2% |
| 4 | 753 | |
| 5 | 908 | |
| 6 | 1705 |
| Value | Count | Frequency (%) |
| 6 | 1705 | |
| 5 | 908 | |
| 4 | 753 | |
| 3 | 560 | 8.2% |
| 2 | 554 | 8.1% |
| 1 | 774 | |
| 0 | 1553 |
end_hour
Real number (ℝ)
Zeros
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.132364 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 308 |
| Zeros (%) | 4.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 15 |
| Q3 | 19 |
| 95-th percentile | 23 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 7.5027054 |
|---|---|
| Coefficient of variation (CV) | 0.57131416 |
| Kurtosis | -1.2862835 |
| Mean | 13.132364 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.3775747 |
| Sum | 89392 |
| Variance | 56.290588 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19 | 619 | 9.1% |
| 23 | 463 | 6.8% |
| 18 | 446 | 6.6% |
| 21 | 421 | 6.2% |
| 22 | 419 | 6.2% |
| 20 | 387 | 5.7% |
| 1 | 337 | 5.0% |
| 17 | 322 | 4.7% |
| 0 | 308 | 4.5% |
| 16 | 276 | 4.1% |
| Other values (14) | 2809 |
| Value | Count | Frequency (%) |
| 0 | 308 | |
| 1 | 337 | |
| 2 | 237 | |
| 3 | 256 | |
| 4 | 239 | |
| 5 | 190 | |
| 6 | 246 | |
| 7 | 239 | |
| 8 | 139 | |
| 9 | 132 | 1.9% |
| Value | Count | Frequency (%) |
| 23 | 463 | |
| 22 | 419 | |
| 21 | 421 | |
| 20 | 387 | |
| 19 | 619 | |
| 18 | 446 | |
| 17 | 322 | |
| 16 | 276 | |
| 15 | 257 | |
| 14 | 135 | 2.0% |
item_user_median_activity
Real number (ℝ)
High correlation
| Distinct | 770 |
|---|---|
| Distinct (%) | 11.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 317.11936 |
| Minimum | 127 |
|---|---|
| Maximum | 721.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.3 KiB |
Quantile statistics
| Minimum | 127 |
|---|---|
| 5-th percentile | 199 |
| Q1 | 256 |
| median | 305 |
| Q3 | 373.5 |
| 95-th percentile | 468 |
| Maximum | 721.5 |
| Range | 594.5 |
| Interquartile range (IQR) | 117.5 |
Descriptive statistics
| Standard deviation | 83.303291 |
|---|---|
| Coefficient of variation (CV) | 0.26268749 |
| Kurtosis | 0.39401951 |
| Mean | 317.11936 |
| Median Absolute Deviation (MAD) | 56 |
| Skewness | 0.62764842 |
| Sum | 2158631.5 |
| Variance | 6939.4383 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 270 | 41 | 0.6% |
| 275 | 40 | 0.6% |
| 253 | 40 | 0.6% |
| 265 | 40 | 0.6% |
| 260 | 37 | 0.5% |
| 279 | 36 | 0.5% |
| 354 | 36 | 0.5% |
| 271 | 35 | 0.5% |
| 246 | 35 | 0.5% |
| 258 | 35 | 0.5% |
| Other values (760) | 6432 |
| Value | Count | Frequency (%) |
| 127 | 1 | < 0.1% |
| 130 | 1 | < 0.1% |
| 132 | 1 | < 0.1% |
| 134 | 1 | < 0.1% |
| 135 | 2 | < 0.1% |
| 137 | 1 | < 0.1% |
| 138 | 2 | < 0.1% |
| 139 | 2 | < 0.1% |
| 140 | 1 | < 0.1% |
| 143 | 5 |
| Value | Count | Frequency (%) |
| 721.5 | 1 | |
| 708.5 | 1 | |
| 670 | 1 | |
| 648 | 1 | |
| 645 | 1 | |
| 634.5 | 1 | |
| 629.5 | 1 | |
| 629 | 1 | |
| 626 | 1 | |
| 624.5 | 1 |
item_user_heavy_ratio
Real number (ℝ)
High correlation
| Distinct | 4644 |
|---|---|
| Distinct (%) | 68.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.46294929 |
| Minimum | 0.1261993 |
|---|---|
| Maximum | 0.90384615 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.3 KiB |
Quantile statistics
| Minimum | 0.1261993 |
|---|---|
| 5-th percentile | 0.26793334 |
| Q1 | 0.37662338 |
| median | 0.45849802 |
| Q3 | 0.55 |
| 95-th percentile | 0.66418033 |
| Maximum | 0.90384615 |
| Range | 0.77764685 |
| Interquartile range (IQR) | 0.17337662 |
Descriptive statistics
| Standard deviation | 0.12054376 |
|---|---|
| Coefficient of variation (CV) | 0.26038222 |
| Kurtosis | -0.30902095 |
| Mean | 0.46294929 |
| Median Absolute Deviation (MAD) | 0.085805774 |
| Skewness | 0.048192934 |
| Sum | 3151.2958 |
| Variance | 0.014530799 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.5 | 78 | 1.1% |
| 0.5714285714 | 32 | 0.5% |
| 0.4 | 25 | 0.4% |
| 0.6666666667 | 25 | 0.4% |
| 0.3333333333 | 21 | 0.3% |
| 0.4285714286 | 20 | 0.3% |
| 0.6 | 19 | 0.3% |
| 0.5555555556 | 18 | 0.3% |
| 0.4615384615 | 17 | 0.2% |
| 0.5833333333 | 16 | 0.2% |
| Other values (4634) | 6536 |
| Value | Count | Frequency (%) |
| 0.1261992995 | 1 | |
| 0.1299560666 | 1 | |
| 0.1332833497 | 1 | |
| 0.1339962641 | 1 | |
| 0.1348524879 | 1 | |
| 0.135464408 | 1 | |
| 0.1365273775 | 1 | |
| 0.1385244401 | 1 | |
| 0.1455991586 | 1 | |
| 0.146209691 | 1 |
| Value | Count | Frequency (%) |
| 0.9038461538 | 1 | |
| 0.84 | 1 | |
| 0.82 | 1 | |
| 0.8166666667 | 1 | |
| 0.8142857143 | 1 | |
| 0.8125 | 1 | |
| 0.8076923077 | 2 | |
| 0.8064516129 | 1 | |
| 0.8039215686 | 1 | |
| 0.8 | 1 |
n_genres
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.3406787 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.3 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.1140493 |
|---|---|
| Coefficient of variation (CV) | 0.47595139 |
| Kurtosis | 0.50515521 |
| Mean | 2.3406787 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.71163399 |
| Sum | 15933 |
| Variance | 1.2411058 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 2280 | |
| 3 | 1786 | |
| 1 | 1756 | |
| 4 | 726 | 10.7% |
| 5 | 212 | 3.1% |
| 6 | 38 | 0.6% |
| 7 | 7 | 0.1% |
| 8 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1756 | |
| 2 | 2280 | |
| 3 | 1786 | |
| 4 | 726 | 10.7% |
| 5 | 212 | 3.1% |
| 6 | 38 | 0.6% |
| 7 | 7 | 0.1% |
| 8 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 7 | 0.1% |
| 6 | 38 | 0.6% |
| 5 | 212 | 3.1% |
| 4 | 726 | 10.7% |
| 3 | 1786 | |
| 2 | 2280 | |
| 1 | 1756 |
n_writers
Real number (ℝ)
Zeros
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.6609373 |
| Minimum | 0 |
|---|---|
| Maximum | 24 |
| Zeros | 1159 |
| Zeros (%) | 17.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 24 |
| Range | 24 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.5973385 |
|---|---|
| Coefficient of variation (CV) | 0.96170912 |
| Kurtosis | 31.058741 |
| Mean | 1.6609373 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 3.6935449 |
| Sum | 11306 |
| Variance | 2.5514903 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2660 | |
| 2 | 1681 | |
| 0 | 1159 | |
| 3 | 699 | 10.3% |
| 4 | 323 | 4.7% |
| 5 | 137 | 2.0% |
| 6 | 67 | 1.0% |
| 7 | 26 | 0.4% |
| 8 | 17 | 0.2% |
| 9 | 11 | 0.2% |
| Other values (11) | 27 | 0.4% |
| Value | Count | Frequency (%) |
| 0 | 1159 | |
| 1 | 2660 | |
| 2 | 1681 | |
| 3 | 699 | 10.3% |
| 4 | 323 | 4.7% |
| 5 | 137 | 2.0% |
| 6 | 67 | 1.0% |
| 7 | 26 | 0.4% |
| 8 | 17 | 0.2% |
| 9 | 11 | 0.2% |
| Value | Count | Frequency (%) |
| 24 | 1 | < 0.1% |
| 23 | 1 | < 0.1% |
| 21 | 2 | |
| 19 | 2 | |
| 18 | 2 | |
| 17 | 1 | < 0.1% |
| 16 | 3 | |
| 15 | 1 | < 0.1% |
| 12 | 2 | |
| 11 | 4 |
n_directors
Real number (ℝ)
Zeros
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.86748935 |
| Minimum | 0 |
|---|---|
| Maximum | 14 |
| Zeros | 1304 |
| Zeros (%) | 19.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 53.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 14 |
| Range | 14 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.56653612 |
|---|---|
| Coefficient of variation (CV) | 0.65307559 |
| Kurtosis | 68.127255 |
| Mean | 0.86748935 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.9816966 |
| Sum | 5905 |
| Variance | 0.32096318 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 5220 | |
| 0 | 1304 | 19.2% |
| 2 | 225 | 3.3% |
| 3 | 37 | 0.5% |
| 4 | 8 | 0.1% |
| 6 | 4 | 0.1% |
| 7 | 3 | < 0.1% |
| 5 | 3 | < 0.1% |
| 10 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1304 | 19.2% |
| 1 | 5220 | |
| 2 | 225 | 3.3% |
| 3 | 37 | 0.5% |
| 4 | 8 | 0.1% |
| 5 | 3 | < 0.1% |
| 6 | 4 | 0.1% |
| 7 | 3 | < 0.1% |
| 8 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 14 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 7 | 3 | < 0.1% |
| 6 | 4 | 0.1% |
| 5 | 3 | < 0.1% |
| 4 | 8 | 0.1% |
| 3 | 37 | 0.5% |
| 2 | 225 | 3.3% |
| 1 | 5220 |
genre_combo
Unsupported
Rejected Unsupported
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 443.7 KiB |
Interactions
Correlations
| item | review_count | year | activity_span | activity_days | start_year | start_month | start_weekday | start_hour | end_year | end_month | end_weekday | end_hour | item_user_median_activity | item_user_heavy_ratio | n_genres | n_writers | n_directors | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| item | 1.000 | -0.132 | 0.512 | -0.943 | -0.943 | 0.940 | 0.447 | 0.259 | -0.005 | 0.065 | -0.039 | -0.005 | 0.030 | 0.143 | 0.185 | 0.044 | -0.079 | -0.093 |
| review_count | -0.132 | 1.000 | 0.039 | 0.154 | 0.154 | -0.134 | -0.049 | -0.238 | 0.079 | 0.124 | -0.061 | -0.221 | 0.123 | -0.466 | -0.559 | 0.162 | 0.149 | 0.144 |
| year | 0.512 | 0.039 | 1.000 | -0.449 | -0.449 | 0.452 | 0.224 | 0.087 | 0.012 | 0.085 | -0.070 | -0.010 | 0.084 | -0.188 | -0.150 | 0.058 | -0.158 | -0.115 |
| activity_span | -0.943 | 0.154 | -0.449 | 1.000 | 1.000 | -0.989 | -0.401 | -0.231 | 0.005 | 0.033 | -0.005 | 0.002 | -0.020 | -0.203 | -0.247 | -0.022 | 0.078 | 0.094 |
| activity_days | -0.943 | 0.154 | -0.449 | 1.000 | 1.000 | -0.989 | -0.401 | -0.231 | 0.006 | 0.033 | -0.005 | 0.002 | -0.020 | -0.204 | -0.247 | -0.022 | 0.078 | 0.094 |
| start_year | 0.940 | -0.134 | 0.452 | -0.989 | -0.989 | 1.000 | 0.325 | 0.214 | -0.001 | 0.076 | -0.041 | 0.001 | 0.026 | 0.180 | 0.224 | 0.032 | -0.067 | -0.086 |
| start_month | 0.447 | -0.049 | 0.224 | -0.401 | -0.401 | 0.325 | 1.000 | 0.186 | 0.011 | 0.007 | -0.008 | -0.036 | 0.016 | 0.055 | 0.067 | 0.027 | -0.026 | -0.023 |
| start_weekday | 0.259 | -0.238 | 0.087 | -0.231 | -0.231 | 0.214 | 0.186 | 1.000 | -0.071 | -0.050 | 0.032 | 0.034 | -0.013 | 0.164 | 0.193 | -0.016 | -0.060 | -0.053 |
| start_hour | -0.005 | 0.079 | 0.012 | 0.005 | 0.006 | -0.001 | 0.011 | -0.071 | 1.000 | 0.032 | -0.002 | -0.025 | 0.010 | -0.059 | -0.069 | 0.017 | 0.007 | 0.010 |
| end_year | 0.065 | 0.124 | 0.085 | 0.033 | 0.033 | 0.076 | 0.007 | -0.050 | 0.032 | 1.000 | -0.747 | 0.001 | 0.057 | -0.171 | -0.161 | 0.087 | 0.065 | 0.066 |
| end_month | -0.039 | -0.061 | -0.070 | -0.005 | -0.005 | -0.041 | -0.008 | 0.032 | -0.002 | -0.747 | 1.000 | 0.003 | -0.050 | 0.093 | 0.081 | -0.048 | -0.026 | -0.038 |
| end_weekday | -0.005 | -0.221 | -0.010 | 0.002 | 0.002 | 0.001 | -0.036 | 0.034 | -0.025 | 0.001 | 0.003 | 1.000 | -0.011 | 0.114 | 0.136 | -0.042 | -0.035 | -0.054 |
| end_hour | 0.030 | 0.123 | 0.084 | -0.020 | -0.020 | 0.026 | 0.016 | -0.013 | 0.010 | 0.057 | -0.050 | -0.011 | 1.000 | -0.113 | -0.120 | 0.034 | 0.047 | 0.030 |
| item_user_median_activity | 0.143 | -0.466 | -0.188 | -0.203 | -0.204 | 0.180 | 0.055 | 0.164 | -0.059 | -0.171 | 0.093 | 0.114 | -0.113 | 1.000 | 0.967 | -0.162 | -0.127 | -0.059 |
| item_user_heavy_ratio | 0.185 | -0.559 | -0.150 | -0.247 | -0.247 | 0.224 | 0.067 | 0.193 | -0.069 | -0.161 | 0.081 | 0.136 | -0.120 | 0.967 | 1.000 | -0.168 | -0.151 | -0.083 |
| n_genres | 0.044 | 0.162 | 0.058 | -0.022 | -0.022 | 0.032 | 0.027 | -0.016 | 0.017 | 0.087 | -0.048 | -0.042 | 0.034 | -0.162 | -0.168 | 1.000 | 0.225 | 0.102 |
| n_writers | -0.079 | 0.149 | -0.158 | 0.078 | 0.078 | -0.067 | -0.026 | -0.060 | 0.007 | 0.065 | -0.026 | -0.035 | 0.047 | -0.127 | -0.151 | 0.225 | 1.000 | 0.357 |
| n_directors | -0.093 | 0.144 | -0.115 | 0.094 | 0.094 | -0.086 | -0.023 | -0.053 | 0.010 | 0.066 | -0.038 | -0.054 | 0.030 | -0.059 | -0.083 | 0.102 | 0.357 | 1.000 |
| item | review_count | year | activity_span | activity_days | start_year | start_month | start_weekday | start_hour | end_year | end_month | end_weekday | end_hour | item_user_median_activity | item_user_heavy_ratio | n_genres | n_writers | n_directors | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| item | 1.000 | -0.223 | 0.677 | -0.791 | -0.791 | 0.828 | 0.348 | 0.275 | 0.001 | -0.003 | 0.003 | 0.002 | 0.011 | 0.184 | 0.199 | 0.036 | -0.128 | -0.139 |
| review_count | -0.223 | 1.000 | -0.014 | 0.493 | 0.493 | -0.186 | -0.109 | -0.236 | 0.064 | 0.368 | 0.120 | -0.127 | 0.105 | -0.602 | -0.621 | 0.214 | 0.215 | 0.243 |
| year | 0.677 | -0.014 | 1.000 | -0.573 | -0.573 | 0.687 | 0.272 | 0.169 | 0.006 | 0.112 | 0.018 | -0.005 | 0.081 | -0.064 | -0.050 | 0.059 | -0.166 | -0.124 |
| activity_span | -0.791 | 0.493 | -0.573 | 1.000 | 1.000 | -0.841 | -0.381 | -0.356 | 0.019 | 0.206 | 0.046 | -0.042 | 0.032 | -0.375 | -0.397 | 0.040 | 0.176 | 0.188 |
| activity_days | -0.791 | 0.493 | -0.573 | 1.000 | 1.000 | -0.841 | -0.381 | -0.356 | 0.020 | 0.206 | 0.046 | -0.041 | 0.032 | -0.375 | -0.397 | 0.040 | 0.176 | 0.188 |
| start_year | 0.828 | -0.186 | 0.687 | -0.841 | -0.841 | 1.000 | 0.290 | 0.235 | 0.011 | 0.050 | 0.017 | 0.006 | 0.031 | 0.176 | 0.196 | 0.047 | -0.110 | -0.125 |
| start_month | 0.348 | -0.109 | 0.272 | -0.381 | -0.381 | 0.290 | 1.000 | 0.203 | 0.008 | -0.023 | 0.013 | -0.023 | 0.012 | 0.069 | 0.077 | 0.008 | -0.033 | -0.036 |
| start_weekday | 0.275 | -0.236 | 0.169 | -0.356 | -0.356 | 0.235 | 0.203 | 1.000 | -0.064 | -0.060 | -0.012 | 0.039 | -0.021 | 0.178 | 0.190 | -0.017 | -0.059 | -0.061 |
| start_hour | 0.001 | 0.064 | 0.006 | 0.019 | 0.020 | 0.011 | 0.008 | -0.064 | 1.000 | 0.023 | 0.007 | -0.022 | 0.005 | -0.050 | -0.053 | 0.014 | 0.003 | 0.007 |
| end_year | -0.003 | 0.368 | 0.112 | 0.206 | 0.206 | 0.050 | -0.023 | -0.060 | 0.023 | 1.000 | -0.552 | 0.001 | 0.065 | -0.183 | -0.181 | 0.091 | 0.079 | 0.087 |
| end_month | 0.003 | 0.120 | 0.018 | 0.046 | 0.046 | 0.017 | 0.013 | -0.012 | 0.007 | -0.552 | 1.000 | 0.009 | -0.008 | -0.115 | -0.118 | 0.036 | 0.054 | 0.031 |
| end_weekday | 0.002 | -0.127 | -0.005 | -0.042 | -0.041 | 0.006 | -0.023 | 0.039 | -0.022 | 0.001 | 0.009 | 1.000 | 0.003 | 0.111 | 0.114 | -0.039 | -0.042 | -0.050 |
| end_hour | 0.011 | 0.105 | 0.081 | 0.032 | 0.032 | 0.031 | 0.012 | -0.021 | 0.005 | 0.065 | -0.008 | 0.003 | 1.000 | -0.109 | -0.111 | 0.025 | 0.036 | 0.033 |
| item_user_median_activity | 0.184 | -0.602 | -0.064 | -0.375 | -0.375 | 0.176 | 0.069 | 0.178 | -0.050 | -0.183 | -0.115 | 0.111 | -0.109 | 1.000 | 0.986 | -0.160 | -0.140 | -0.065 |
| item_user_heavy_ratio | 0.199 | -0.621 | -0.050 | -0.397 | -0.397 | 0.196 | 0.077 | 0.190 | -0.053 | -0.181 | -0.118 | 0.114 | -0.111 | 0.986 | 1.000 | -0.162 | -0.148 | -0.074 |
| n_genres | 0.036 | 0.214 | 0.059 | 0.040 | 0.040 | 0.047 | 0.008 | -0.017 | 0.014 | 0.091 | 0.036 | -0.039 | 0.025 | -0.160 | -0.162 | 1.000 | 0.197 | 0.098 |
| n_writers | -0.128 | 0.215 | -0.166 | 0.176 | 0.176 | -0.110 | -0.033 | -0.059 | 0.003 | 0.079 | 0.054 | -0.042 | 0.036 | -0.140 | -0.148 | 0.197 | 1.000 | 0.343 |
| n_directors | -0.139 | 0.243 | -0.124 | 0.188 | 0.188 | -0.125 | -0.036 | -0.061 | 0.007 | 0.087 | 0.031 | -0.050 | 0.033 | -0.065 | -0.074 | 0.098 | 0.343 | 1.000 |
| activity_days | end_hour | end_month | end_weekday | end_year | item | item_user_heavy_ratio | item_user_median_activity | n_directors | n_genres | n_writers | review_count | start_hour | start_month | start_weekday | start_year | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| activity_days | 1.000 | 0.032 | 0.046 | -0.041 | 0.206 | -0.791 | -0.397 | -0.375 | 0.188 | 0.040 | 0.176 | 0.493 | 0.020 | -0.381 | -0.356 | -0.841 | -0.573 |
| end_hour | 0.032 | 1.000 | -0.008 | 0.003 | 0.065 | 0.011 | -0.111 | -0.109 | 0.033 | 0.025 | 0.036 | 0.105 | 0.005 | 0.012 | -0.021 | 0.031 | 0.081 |
| end_month | 0.046 | -0.008 | 1.000 | 0.009 | -0.552 | 0.003 | -0.118 | -0.115 | 0.031 | 0.036 | 0.054 | 0.120 | 0.007 | 0.013 | -0.012 | 0.017 | 0.018 |
| end_weekday | -0.041 | 0.003 | 0.009 | 1.000 | 0.001 | 0.002 | 0.114 | 0.111 | -0.050 | -0.039 | -0.042 | -0.127 | -0.022 | -0.023 | 0.039 | 0.006 | -0.005 |
| end_year | 0.206 | 0.065 | -0.552 | 0.001 | 1.000 | -0.003 | -0.181 | -0.183 | 0.087 | 0.091 | 0.079 | 0.368 | 0.023 | -0.023 | -0.060 | 0.050 | 0.112 |
| item | -0.791 | 0.011 | 0.003 | 0.002 | -0.003 | 1.000 | 0.199 | 0.184 | -0.139 | 0.036 | -0.128 | -0.223 | 0.001 | 0.348 | 0.275 | 0.828 | 0.677 |
| item_user_heavy_ratio | -0.397 | -0.111 | -0.118 | 0.114 | -0.181 | 0.199 | 1.000 | 0.986 | -0.074 | -0.162 | -0.148 | -0.621 | -0.053 | 0.077 | 0.190 | 0.196 | -0.050 |
| item_user_median_activity | -0.375 | -0.109 | -0.115 | 0.111 | -0.183 | 0.184 | 0.986 | 1.000 | -0.065 | -0.160 | -0.140 | -0.602 | -0.050 | 0.069 | 0.178 | 0.176 | -0.064 |
| n_directors | 0.188 | 0.033 | 0.031 | -0.050 | 0.087 | -0.139 | -0.074 | -0.065 | 1.000 | 0.098 | 0.343 | 0.243 | 0.007 | -0.036 | -0.061 | -0.125 | -0.124 |
| n_genres | 0.040 | 0.025 | 0.036 | -0.039 | 0.091 | 0.036 | -0.162 | -0.160 | 0.098 | 1.000 | 0.197 | 0.214 | 0.014 | 0.008 | -0.017 | 0.047 | 0.059 |
| n_writers | 0.176 | 0.036 | 0.054 | -0.042 | 0.079 | -0.128 | -0.148 | -0.140 | 0.343 | 0.197 | 1.000 | 0.215 | 0.003 | -0.033 | -0.059 | -0.110 | -0.166 |
| review_count | 0.493 | 0.105 | 0.120 | -0.127 | 0.368 | -0.223 | -0.621 | -0.602 | 0.243 | 0.214 | 0.215 | 1.000 | 0.064 | -0.109 | -0.236 | -0.186 | -0.014 |
| start_hour | 0.020 | 0.005 | 0.007 | -0.022 | 0.023 | 0.001 | -0.053 | -0.050 | 0.007 | 0.014 | 0.003 | 0.064 | 1.000 | 0.008 | -0.064 | 0.011 | 0.006 |
| start_month | -0.381 | 0.012 | 0.013 | -0.023 | -0.023 | 0.348 | 0.077 | 0.069 | -0.036 | 0.008 | -0.033 | -0.109 | 0.008 | 1.000 | 0.203 | 0.290 | 0.272 |
| start_weekday | -0.356 | -0.021 | -0.012 | 0.039 | -0.060 | 0.275 | 0.190 | 0.178 | -0.061 | -0.017 | -0.059 | -0.236 | -0.064 | 0.203 | 1.000 | 0.235 | 0.169 |
| start_year | -0.841 | 0.031 | 0.017 | 0.006 | 0.050 | 0.828 | 0.196 | 0.176 | -0.125 | 0.047 | -0.110 | -0.186 | 0.011 | 0.290 | 0.235 | 1.000 | 0.687 |
| year | -0.573 | 0.081 | 0.018 | -0.005 | 0.112 | 0.677 | -0.050 | -0.064 | -0.124 | 0.059 | -0.166 | -0.014 | 0.006 | 0.272 | 0.169 | 0.687 | 1.000 |
Missing values
Sample
| item | review_count | genres | directors | writers | year | title | start_datetime | end_datetime | activity_span | activity_days | start_year | start_month | start_weekday | start_hour | end_year | end_month | end_weekday | end_hour | item_user_median_activity | item_user_heavy_ratio | n_genres | n_writers | n_directors | genre_combo | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 12217 | [Adventure, Animation, Children, Comedy, Fantasy] | [nm0005124] | [nm0004056, nm0005124, nm0169505, nm0230032, nm0710020, nm0812513, nm0923736] | 1995.0 | Toy Story (1995) | 2005-04-11 12:41:27 | 2015-03-30 19:57:57 | 3640 days 07:16:30 | 3640 | 2005 | 4 | 0 | 12 | 2015 | 3 | 0 | 19 | 157.0 | 0.166244 | 5 | 7 | 1 | (Adventure, Animation, Children, Comedy, Fantasy) |
| 1 | 2 | 3364 | [Adventure, Children, Fantasy] | [nm0002653] | [nm0378144, nm0852430, nm0885575] | 1995.0 | Jumanji (1995) | 2005-04-13 07:34:58 | 2015-03-30 12:31:07 | 3638 days 04:56:09 | 3638 | 2005 | 4 | 2 | 7 | 2015 | 3 | 0 | 12 | 216.5 | 0.263080 | 3 | 3 | 1 | (Adventure, Children, Fantasy) |
| 2 | 3 | 734 | [Comedy, Romance] | [nm0222043] | [nm0425756] | 1995.0 | Grumpier Old Men (1995) | 2005-04-11 17:55:49 | 2015-03-26 18:25:06 | 3636 days 00:29:17 | 3636 | 2005 | 4 | 0 | 17 | 2015 | 3 | 3 | 18 | 282.0 | 0.388283 | 2 | 1 | 1 | (Comedy, Romance) |
| 3 | 4 | 43 | [Comedy, Drama, Romance] | [nm0001845] | [nm0060103] | 1995.0 | Waiting to Exhale (1995) | 2005-05-09 10:54:42 | 2015-03-22 21:45:09 | 3604 days 10:50:27 | 3604 | 2005 | 5 | 0 | 10 | 2015 | 3 | 6 | 21 | 376.0 | 0.581395 | 3 | 1 | 1 | (Comedy, Drama, Romance) |
| 4 | 5 | 590 | [Comedy] | [nm0796124] | [nm0329304, nm0352443, nm0583600, nm0796124] | 1995.0 | Father of the Bride Part II (1995) | 2005-04-13 16:16:13 | 2015-03-22 21:52:02 | 3630 days 05:35:49 | 3630 | 2005 | 4 | 2 | 16 | 2015 | 3 | 6 | 21 | 240.0 | 0.335593 | 1 | 4 | 1 | (Comedy,) |
| 5 | 6 | 5124 | [Action, Crime, Thriller] | [nm0000520] | [nm0000520] | 1995.0 | Heat (1995) | 2005-04-12 05:22:29 | 2015-03-30 19:25:46 | 3639 days 14:03:17 | 3639 | 2005 | 4 | 1 | 5 | 2015 | 3 | 0 | 19 | 217.0 | 0.274980 | 3 | 1 | 1 | (Action, Crime, Thriller) |
| 6 | 7 | 799 | [Comedy, Romance] | [nm0001628] | [nm0000697, nm0070660, nm0499626, nm0713128, nm0853138] | 1995.0 | Sabrina (1995) | 2005-04-12 20:51:48 | 2015-03-28 04:29:48 | 3636 days 07:38:00 | 3636 | 2005 | 4 | 1 | 20 | 2015 | 3 | 5 | 4 | 239.0 | 0.336671 | 2 | 5 | 1 | (Comedy, Romance) |
| 7 | 8 | 48 | [Adventure, Children] | [nm0382072] | [nm0521739, nm0814085, nm0878494] | 1995.0 | Tom and Huck (1995) | 2005-04-19 17:02:43 | 2015-03-01 17:00:55 | 3602 days 23:58:12 | 3602 | 2005 | 4 | 1 | 17 | 2015 | 3 | 6 | 17 | 355.0 | 0.541667 | 2 | 3 | 1 | (Adventure, Children) |
| 8 | 9 | 66 | [Action] | [nm0001382] | [nm0704164] | 1995.0 | Sudden Death (1995) | 2005-04-19 00:19:14 | 2014-12-29 15:48:41 | 3541 days 15:29:27 | 3541 | 2005 | 4 | 1 | 0 | 2014 | 12 | 0 | 15 | 434.0 | 0.606061 | 1 | 1 | 1 | (Action,) |
| 9 | 10 | 4286 | [Action, Adventure, Thriller] | [nm0132709] | [nm0001220, nm0128997, nm0270761, nm0289833] | 1995.0 | GoldenEye (1995) | 2005-04-11 13:47:25 | 2015-03-30 00:21:39 | 3639 days 10:34:14 | 3639 | 2005 | 4 | 0 | 13 | 2015 | 3 | 0 | 0 | 201.0 | 0.250583 | 3 | 4 | 1 | (Action, Adventure, Thriller) |
| item | review_count | genres | directors | writers | year | title | start_datetime | end_datetime | activity_span | activity_days | start_year | start_month | start_weekday | start_hour | end_year | end_month | end_weekday | end_hour | item_user_median_activity | item_user_heavy_ratio | n_genres | n_writers | n_directors | genre_combo | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6797 | 116823 | 276 | [Adventure, Sci-Fi, Thriller] | [nm1349376] | [nm0185976, nm0834960, nm1056741] | 2014.0 | The Hunger Games: Mockingjay - Part 1 (2014) | 2014-11-13 21:41:34 | 2015-03-30 18:14:30 | 136 days 20:32:56 | 136 | 2014 | 11 | 3 | 21 | 2015 | 3 | 0 | 18 | 243.0 | 0.398551 | 3 | 3 | 1 | (Adventure, Sci-Fi, Thriller) |
| 6798 | 117176 | 188 | [Drama, Romance] | [nm1016428] | NaN | 2014.0 | Theory of Everything, The (2014) | 2014-11-19 00:10:40 | 2015-03-30 18:18:10 | 131 days 18:07:30 | 131 | 2014 | 11 | 2 | 0 | 2015 | 3 | 0 | 18 | 323.0 | 0.478723 | 2 | 0 | 1 | (Drama, Romance) |
| 6799 | 117533 | 44 | [Documentary] | NaN | NaN | 2014.0 | Citizenfour (2014) | 2014-11-27 14:40:19 | 2015-03-27 16:30:49 | 120 days 01:50:30 | 120 | 2014 | 11 | 3 | 14 | 2015 | 3 | 4 | 16 | 248.0 | 0.409091 | 1 | 0 | 0 | (Documentary,) |
| 6800 | 117881 | 38 | [Drama] | [nm0322144, nm0922903] | [nm0322144, nm0922903] | 2014.0 | Still Alice (2014) | 2014-12-05 14:57:05 | 2015-03-27 16:48:43 | 112 days 01:51:38 | 112 | 2014 | 12 | 4 | 14 | 2015 | 3 | 4 | 16 | 388.5 | 0.526316 | 1 | 2 | 2 | (Drama,) |
| 6801 | 118696 | 232 | [Adventure, Fantasy] | [nm0001392] | [nm0001392, nm0101991, nm0866058, nm0868219, nm0909638] | 2014.0 | The Hobbit: The Battle of the Five Armies (2014) | 2014-12-10 20:00:39 | 2015-03-29 22:48:33 | 109 days 02:47:54 | 109 | 2014 | 12 | 2 | 20 | 2015 | 3 | 6 | 22 | 309.0 | 0.465517 | 2 | 5 | 1 | (Adventure, Fantasy) |
| 6802 | 118700 | 54 | [Drama] | NaN | NaN | 2014.0 | Selma (2014) | 2015-01-03 05:32:06 | 2015-03-29 19:32:30 | 85 days 14:00:24 | 85 | 2015 | 1 | 5 | 5 | 2015 | 3 | 6 | 19 | 430.5 | 0.629630 | 1 | 0 | 0 | (Drama,) |
| 6803 | 118900 | 60 | [Drama] | [nm0885249] | [nm0394984] | 2014.0 | Wild (2014) | 2014-12-15 12:46:26 | 2015-03-28 09:46:33 | 102 days 21:00:07 | 102 | 2014 | 12 | 0 | 12 | 2015 | 3 | 5 | 9 | 492.0 | 0.683333 | 1 | 1 | 1 | (Drama,) |
| 6804 | 118997 | 52 | [Children, Comedy, Fantasy, Musical] | [nm0551128] | [nm0487567] | 2014.0 | Into the Woods (2014) | 2014-12-26 01:10:34 | 2015-03-27 15:34:34 | 91 days 14:24:00 | 91 | 2014 | 12 | 4 | 1 | 2015 | 3 | 4 | 15 | 396.0 | 0.596154 | 4 | 1 | 1 | (Children, Comedy, Fantasy, Musical) |
| 6805 | 119141 | 122 | [Action, Comedy] | [nm0736622, nm1698571] | [nm0736622, nm1698571] | 2014.0 | The Interview (2014) | 2014-12-25 00:22:55 | 2015-03-30 03:46:08 | 95 days 03:23:13 | 95 | 2014 | 12 | 3 | 0 | 2015 | 3 | 0 | 3 | 336.5 | 0.500000 | 2 | 2 | 2 | (Action, Comedy) |
| 6806 | 119145 | 78 | [Action, Adventure, Comedy, Crime] | [nm0891216] | [nm0891216, nm0963359, nm1733301, nm2092839] | NaN | Kingsman: The Secret Service (2015) | 2014-12-19 21:02:55 | 2015-03-30 19:58:21 | 100 days 22:55:26 | 100 | 2014 | 12 | 4 | 21 | 2015 | 3 | 0 | 19 | 349.0 | 0.551282 | 4 | 4 | 1 | (Action, Adventure, Comedy, Crime) |